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Abstract 

Bats provide key ecosystem services such as crop pest regulation, pollination, seed dispersal, 
and soil fertilization. Bats are also major hosts for biological agents responsible for zoonoses, 
such as coronaviruses (CoVs). The islands of the Western Indian Ocean are identified as a major 
biodiversity hotspot, with more than 50 bat species. In this study, we tested 1,013 bats belonging 
to 36 species from Mozambique, Madagascar, Mauritius, Mayotte, Reunion Island and Sey- 
chelles, based on molecular screening and partial sequencing of the RNA-dependent RNA pol- 
ymerase gene. In total, 88 bats (8.796) tested positive for coronaviruses, with higher prevalence 
in Mozambican bats (20.5% + 4.996) as compared to those sampled on islands (4.596 + 1.5%). 
Phylogenetic analyses revealed a large diversity of a- and B-CoVs and a strong signal of co- 
evolution between CoVs and their bat host species, with limited evidence for host-switching, 


except for bat species sharing day roost sites. 


Importance 
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This is the first study to report the presence of coronaviruses (CoVs) in bats in Mayotte, 
Mozambique and Reunion Island, and in insectivorous bats in Madagascar. Eight percent of the 
tested bats were positive for CoVs, with higher prevalence in continental Africa than on islands. 
A high genetic diversity of a- and B-CoVs was found, with strong association between bat host 
and virus phylogenies, supporting a long history of co-evolution between bats and their associ- 
ated CoVs in the Western Indian Ocean. These results highlight that strong variation between 
islands does exist and is associated with the composition of the bat species community on each 
island. Future studies should investigate whether CoVs detected in these bats have a potential 


for spillover in other hosts. 


Keywords: bat, coronavirus, islands, tropical, evolution, ecology 
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Text — 3158 words 
Introduction 

The burden of emerging infectious diseases has significantly increased over the last decades 
and is recognized as a major global health concern. In 2018, the World Health Organization 
(WHO) established the “Blueprint priority disease list”, identifying viruses such as Ebola, Lassa 
fever, Middle East Respiratory Syndrome (MERS), and Nipah fever as significant threats to 
international biosecurity !. This list also highlights the potential pandemic risk from the emer- 
gence of currently unknown zoonotic pathogens, collectively referring to these unknown threats 
as “disease X" '. Investigation of the potential zoonotic pathogens in wild animals, particularly 
vertebrates, is thus critical for emerging infectious disease preparedness and responses. 

Bats represent nearly 1,400 species and live on all continents except Antarctica 7. They 
provide key ecosystem services such as crop pest regulation, pollination, seed dispersal, and 
soil fertilization ?-'?. Bats are also recognized as reservoirs of many zoonotic pathogens, includ- 
ing coronaviruses (CoVs) !!-?. Indeed, several CoVs originating from bats have emerged in 
humans and livestock with sometimes major impacts to public health. For instance, in 2003, the 
Severe Acute Respiratory Syndrome (SARS) CoV emerged in humans, after spillover from bats 


to civets!^ !? 


, and led to the infection of 8,096 people and 774 deaths in less than a year |”. 
Our study area spans geographic locations across the islands of the Western Indian 
Ocean and southeastern continental Africa (SECA) (Figure 1). These islands have diverse ge- 
ological origins that have influenced the process of bat colonization and species distributions 
20 The ecological settings and species diversity on these islands for bats are notably different. 
On Madagascar, more than 45 bat species are known to occur, of which more than 80 % are 


endemic to the island ?' ??, The smaller studied islands of the Western Indian Ocean, Mauritius, 


Mayotte, Reunion Island, and Mahé (Seychelles), host reduced bat species diversity (e.g. three 
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species on Reunion Island), whereas SECA supports a wide range of bat species. To date, sev- 
eral studies have identified bat-infecting CoVs in countries of continental Africa, including 
Zimbabwe ?^, South Africa ??9, Namibia ?", and Kenya ?9??, CoVs have also been reported in 
fruit bats (Pteropodidae) in Madagascar, where B-coronaviruses belonging to the D-subgroup 
were identified in Eidolon dupreanum and Pteropus rufus *°. 

In this study, we investigated the presence of CoVs in over 1,000 individual bats be- 
longing to 36 species, sampled on five islands (Madagascar, Mauritius, Mayotte, Reunion Is- 
land, and Mahé) and one continental area (Mozambique). Based on molecular screening and 
partial sequencing of the RNA-dependent RNA polymerase gene, we (1) estimated CoV preva- 
lence in the regional bat populations, (ii) assessed CoV genetic diversity, and (iii) identified 
associations between bat families and CoVs, as well as potential evolutionary drivers leading 


to these associations. 


Results 
Prevalence of CoV 

A total of 1,013 bats were tested from Mozambique, Mayotte, Reunion Island, Sey- 
chelles, Mauritius and Madagascar (Figure 1). In total, 88 of the 1,013 bat samples tested pos- 
itive for CoV by Real-Time PCR (mean detection rate: 8.796). The prevalence of positive bats 
was different according to the sampling locations (3? = 77.0, df = 5; p«0.001), with a higher 
prevalence in Mozambique (+ 95% confidence interval: 20.5% + 4.9%) than on all Western 
Indian Ocean islands (4.5% + 1.5%) (Figure 2). A significant difference in the prevalence of 
positive bats was also detected between families (x? = 44.8, df = 8; p«0.001; Supplementary 
Figure S1). The highest prevalence were observed in the families Nycteridae (28.6 96 + 23.6%) 


and Rhinolophidae (26.2% + 11.0%). Bat species had a significant effect on the probability of 
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CoVs detection (x? = 147.9, df = 39; p<0.001; Supplementary Figure S2). Finally, the preva- 
lence of CoV positive bats in Mozambique was significantly different (N = 264, y?= 22.8, df= 
1; p<0.001; Supplementary Figure S3) between February (37.4% + 9.9%) and May (11.6% + 


4.8). 


RdRp sequence diversity 

Of the 88 positive samples, we obtained 77 partial RdRp sequences using the Real-Time 
PCR detection system (179 bp) and 51 longer partial RdRp sequences using a second PCR 
system (440 bp). Sequences generated with the second system were subsequently used for phy- 
logenetic analyses. Details of the sequenced CoV-positive samples are provided in Supplemen- 
tary Table S1. Pairwise comparison of these 51 sequences revealed 28 unique sequences, and 
sequences similarities ranging from 60.2% to 99.8%. The lowest sequence similarity was found 
in Mozambique (60.2% to 99.8%), then in Madagascar (64.0% to 99.8%). No genetic variation 


was observed for samples from Mayotte and Reunion Island. 


Phylogenetic structure of CoVs 

Sequence comparison indicated that Western Indian Ocean bats harbor a high diversity 
of both a and B-CoVs, with conserved groups clustering mostly by bat family (Figure 3). Spe- 
cifically, 25 sequences were identified as a-CoVs, and three sequences were genetically related 
to the B-CoVs. For a-CoVs, all sequences detected in our tested Molossidae formed a highly 
supported monophyletic group, including CoV sequences from Molossidae bats previously de- 
tected in continental Africa (Figure 4). CoVs detected in Mops condylurus (Mozambique), Mor- 
mopterus francoismoutoui (Reunion Island), Chaerephon pusillus and Chaerephon sp. (Ma- 


yotte), and Mormopterus jugularis (Madagascar) shared 9096 - 9896 nucleotide similarity with 
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a CoV detected in Chaerephon sp. in Kenya (Supplementary Table S2). All CoVs found in 
Miniopteridae clustered in a monophyletic group, including Miniopteridae CoVs sequences 
from Africa, Asia, and Oceania (Supplementary Table S2). The great majority of a-CoVs de- 
tected in Rhinolophidae bats clustered in two monophyletic groups (Figure 3); one with African 
Rhinolophidae CoVs and one with Asian Rhinolophidae CoVs. We also detected one CoV from 
Rhinolophus rhodesiae, which was 100% similar to a Miniopteridae CoV from this study. Rhi- 
nonycteridae CoVs formed a single monophyletic group with NL63 Human CoVs. The Rhi- 
nonycteridae CoVs detected clustered with NL63-related bat sequences found in Triaenops afer 
in Kenya (Figure 5) and showed 85% similarity to NL63 Human CoVs (Supplementary Table 
S2). Hipposideridae a-CoVs mainly clustered into a single monophyletic group, including 229E 
Human CoV-related bat sequence found in Hipposideros vittatus from Kenya (Figure 6; Sup- 
plementary Table S2). 

For B-CoVs, two sequences obtained from Nycteris thebaica clustered in the C-sub- 
group together with other CoVs previously reported in African Nycteris sp. bats (Figure 7). The 
sequences showed 88% nucleotide identity to a B-C CoV found in Nycteris gambiensis in Ghana 
(Supplementary Table S2). Rousettus madagascariensis CoVs clustered with Pteropodidae 
CoVs belonging to the D-subgroup of B-CoVs (Figure 8). BLAST queries against the NCBI 
database showed 9846 nucleotide identity between CoV sequences from Rousettus madagasca- 
riensis and a B-D CoV sequence detected in Eidolon helvum from Kenya (Supplementary Table 


S2). 


Co-phylogeny between bats and CoVs 
Co-phylogeny tests were conducted using 11 Cyt b sequences obtained from the 11 


CoVs positive bat species and 27 partial CoV RdRp sequences (440 bp). Results supported co- 
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evolution between the Western Indian Ocean bats and their CoVs (ParaFitGlobal = 0.04; p = 


0.001) and a high level of phylogenetic congruence (Figure 9). 


Discussion 


We provide evidence for a high diversity of CoVs in bats on Western Indian Ocean islands. 
The overall prevalence of CoV positive bats was consistent with studies from continental Africa 
25 and from islands in the Australasian region ?!, although we detected significant variation in 
the prevalence of infected bats, according to their family, species, sampling location and season. 
Our study is nevertheless affected by the strong heterogeneity of bat communities in the island 
of the Western Indian Ocean, in particular in term of species richness. The high CoV genetic 
diversity detected in bats from Mozambique and Madagascar is likely to be associated with the 
higher bat species diversity in the African mainland and in Madagascar, has compared to small 
oceanic islands 7°. In addition, CoV prevalence in bat populations may significantly vary across 
seasons, as found in Mozambique with higher prevalence during the wet season than in the dry 
season. Several studies on bat CoV have indeed shown significant variations in the temporal 
infection dynamic of CoV in bats, potentially associated with bat parturition ?^^, 

Host specificity is well known for some bat CoVs subgenera 3537, For example, B-C CoVs 
are largely associated with Vespertilionidae, whereas B-D CoVs are found mostly in Pteropodi- 
dae ?695. In our study, we showed that Western Indian Ocean bats harbor phylogenetically struc- 
tured CoVs, of both a-CoV and B-CoV subclades, clustering mostly by bat family. In the new 
CoV taxonomy based on full genomes proposed by the International Committee of Taxonomy 
of Viruses (ICTV), a-CoVs and B-CoVs are split in subgenera mostly based on host families °’, 


reflected in the subgenera names (e.g. Rhinacovirus for a Rhinolophidae a-CoV cluster, Min- 
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uacovirus for a Miniopteridae a-CoV cluster, Hibecovirus for an Hipposideridae B-CoV clus- 
ter). Although our classification was based on a partial sequence of the RdRp region, we iden- 
tified sequences from samples belonging to four of these subgenera (Minuacovirus, Duvina- 
covirus, Rhinacovirus, and Nobecovirus) and three that could not be classified according to this 
taxonomic scheme hence representing unclassified subgenera (we propose “Molacovirus”, 
“Nycbecovirus”, and “Rhinacovirus2”). 

A strong geographical influence on CoVs diversity, with independent evolution of CoVs on 
each island, was expected in our study, because of spatial isolation and endemism of the tested 
bat species. Anthony et al. 38 found that the dominant evolutionary mechanism for African CoVs 
was host switching. Congruence between host and viral phylogenies however suggests a strong 
signal for co-evolution between Western Indian Ocean bats and their associated CoVs. Geo- 
graphical influence seems to occur within bat families, as for Molossidae. Endemism resulting 
from geographic isolation may thus have played a role in viral diversification within bat fami- 
lies. 

Although co-evolution could be the dominant mechanism in the Western Indian Ocean, 
host-switching may take place in certain situations. For example, in Mozambique, we found a 
potential Miniopteridae a-CoV in a Rhinolophidae bat co-roosting with Miniopteridae in the 
same cave. These host-switching events could be favored when several bat species roost in 
syntopy “°. A similar scenario was described in Australia where Miniopteridae a-CoV was de- 
tected in Rhinolophidae bats ?!. These infrequent host-switching events show that spillovers 
can happen but suggest that viral transmission is not maintained in the receiver host species. 
The host-virus co-evolution might thus have resulted in strong adaptation of CoVs to each bat 
host species. In addition, viral factors (mutation rate, recombination propensity, replication abil- 


ity in the cytoplasm, changes in the ability to bind host cells), environmental factors (climate 
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variation, habitat degradation, decrease of bat preys), and phylogenetic relatedness of host spe- 
cies are also critical for the viral establishment in a novel host *!“*. Nevertheless, apparent 
evidence of host switching as a dominant mechanism of CoV evolution could be an artifact of 
a lack of data for some potential bat hosts, leading to incomplete phylogenetic reconstructions 
38. 

Several bat CoVs we identified in Rhinonycteridae and Hipposideridae from Mozambique 
had between 8596 and 9396 nucleotide sequence similarity with NL63 Human CoVs and 229E 
Human CoVs, respectively. These two human viruses are widely distributed in the world and 
associated with mild to moderate respiratory infection in humans *°. Tao et al. established that 
the NL63 Human CoVs and 229E Human CoVs have a zoonotic recombinant origin from their 
most recent common ancestor, estimated to be about 1,000 years ago ^. During the past decade, 
they were both detected in bats in Kenya, and in Ghana, Gabon, Kenya, and Zimbabwe, respec- 
tively 74784748" Intermediate hosts are important in the spillover of CoVs, despite major 
knowledge gaps on the transmission routes of bat infectious agents to secondary hosts ?. This 
hypothesis has been formulated for the 229E Human CoV, with an evolutionary origin in Hip- 
posideridae bats and with camelids as intermediate hosts **. The ancient spillover of NL63 from 
Rhinonycteridae bats to humans might have occurred through a currently unidentified interme- 
diate host 78°°°!, Because receptor recognition by viruses is the first essential cellular step to 
infect host cells, CoVs may have spilt over into humans from bats through an intermediate host 
possibly due to mutations on spike genes !*78. Further investigations of CoVs in Kenyan and 
Mozambican livestock and hunted animals could potentially provide information on the com- 
plete evolutionary and emergence history of these two viruses before their establishment in 


humans. 
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MERS-like CoV, with high sequence similarity (>85%) to human and camel strains of 
MERS-CoV, have been detected in Neoromicia capensis in South Africa and Pipistrellus cf. 
hesperidus in Uganda, suggesting a possible origin of camel MERS-CoV in vespertilionid bats 
2538.52 This family has been widely studied, with 30% of all reported bat CoVs sequences from 
the past 20 years coming from vespertilionids ?. No members of this family were positive for 
CoV in our study, which may be associated with the low number of individuals tested; addi- 
tional material is needed to explore potential MERS-like CoV in the Western Indian Ocean, in 
particular on Madagascar. 

Knowledge on bat CoV ecology and epidemiology has significantly increased during the 
past decade. Anthony et al. estimated that there might be at least 3,204 bat CoVs worldwide 38; 
however, direct bat-to-human transmission has not been demonstrated so far. As for most 
emerging zoonoses, CoV spillover and emergence may be associated to human activities and 
ecosystem changes such as habitat fragmentation, agricultural intensification and bushmeat 
consumption. The role of bats as epidemiological reservoir of infectious agents needs to be 
balanced with such human driven modifications on ecosystem functioning, in order to properly 


assess bat-borne CoV emergence risks. 


Materials and methods 


Origin of the tested samples 


Samples obtained from vouchered bat specimens during previous studies in Mozam- 
bique (February and May 2015), Mayotte (November to December 2014), Reunion Island (Feb- 
ruary 2015), Seychelles (February to March 2014), Mauritius (November 2012) and Madagas- 


car (October to November 2014) were tested ?^?" (Supplementary Information). We also col- 
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lected additional swab samples from several synanthropic bat species on Madagascar, in Janu- 
ary 2018 (Supplementary Information). Details on sample types, bat families, species, and lo- 


cations are provided in Supplementary Table S3. 


Ethical statement 

The ethical terms of these research protocols were approved by the CYROI Institu- 
tional Animal Care and Use Committee (Comité d’Ethique du CYROI no.114, IACUC certi- 
fied by the French Ministry of Higher Education, of Research and Innovation). All protocols 
strictly followed the terms of research permits and regulations for the handling of wild mam- 


mals and were approved by licencing authorities (Supplementary Information). 


Molecular detection 


RNA was extracted from 140 uL of each sample using the QIAamp Viral RNA mini kit 
(QIAGEN, Valencia, California, USA), and eluted in 60 uL of Qiagen AVE elution buffer. For 
bat organs, approximately 1 mm? of tissue (either lungs or intestines) was placed in 750 uL of 
DMEM medium and homogenized in a TissueLyser II (Qiagen, Hilden, Germany) for 2 min at 
25 Hz using 3 mm tungsten beads, prior to the RNA extraction. Reverse transcription was per- 
formed on 10 uL of RNA using the ProtoScript II Reverse Transcriptase and Random Primer 6 
(New England BioLabs, Ipswich, MA, USA) under the following thermal conditions: 70 °C for 
5 min, 25 °C for 10 min, 42 °C for 50 min, and 65 °C for 20 min 58. cDNAs were tested for the 
presence of the RNA-dependent RNA-polymerase (RdRp) gene using a multi-probe Real-Time 
PCR ??, The primer set with Locked Nucleic Acids (LNA; underlined position in probe se- 
quences) was purchased from Eurogentec (Seraing, Belgium): 11-FW: 5'-TGA-TGA-TGS- 


NGT-TGT-NTG-YTA-YAA-3' and 13-RV: 5'-GCA-TWG-TRT-GYT-GNG-ARC-ARA- 
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ATT-C-3'. Three probes were used: probe I (ROX): 5’-TTG-TAT-TAT-CAG-AAT-GGY- 


GTS-TTY-AT-3’, probe II (FAM): 5'- TGT-GTT-CAT-GTC-WGA-RGC-WAA-ATG-TT-3', 


and probe III (HEX): 5'-TCT-AAR-TGT-TGG-GTD-GA-3'. Real-Time PCR was performed 


with ABsolute Blue QPCR Mix low ROX 1X (Thermo Fisher Scientific, Waltham, MA, USA) 
and 2.5 uL of cDNA under the following thermal conditions: 95 °C for 15 min, 95 °C for 30 s, 
touchdowns from 56 °C to 50°C for 1 min and 50 cycles with 95 °C for 30 s and 50 °C for 1 
min in a CFX96 Touch Real-Time PCR Detection System (Bio-Rad, Hercules, CA, USA). 
Because of the limited size of sequences generated from the Real-Time PCR, a second 
PCR targeting 440 bp of the RdRp gene was performed with 5 uL of cDNA of each positive 
sample, with the following primer set: IN-6: 5'-GGT-TGG-GAC-TAT-CCT-AAG-TGT-GA- 
3’ and IN-7: 5'-CCA-TCA-TCA-GAT-AGA-ATC-ATC-ATA-3' 9. PCRs were performed 
with the GoTaq G2 Hot Start Green Master Mix (Promega, Madison, WI, USA) in an Applied 
Biosystems 2720 Thermal Cycler (Thermo Fisher Scientific, Waltham, MA, USA), under the 
following thermal conditions: 95 °C for 2 min, 45 cycles with 95 °C for 1 min, 54 °C for 1 min, 
72°C for 1 min, and a final elongation step at 72°C for 10 min. After electrophoresis in a 1.5% 
agarose gel stained with 2% GelRed (Biotium, Hayward, CA, USA), amplicons of the expected 
size were sequenced on both strands by Genoscreen (Lille, France). All generated sequences 


were deposited in GenBank under the accession numbers MN183146 to MN183273. 


Statistical analysis 

We have performed Pearson y? tests on all samples (1,013 bats) to explore the effect of 
(i) location, (ii) bat family, and (iii) bat species on the detection of coronavirus RNA. Two 
sampling campaigns, at two different season, in the same location, were available for Mozam- 


bique. We thus investigated the effect of the sampling season, between the wet (February) and 
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dry (May) season, on CoV detection in Mozambique in 2015 (264 bats). Analyses were con- 


ducted with R v3.5.1 software 9!. 


Phylogenetic analyses 

Sequences obtained with the second PCR system © were edited with the Chromas Lite 
Software package version 2.6.4 ©. We explored CoV diversity of the sequences with pairwise 
identity values obtained from segidentity function in R bio3d package v2.3-4 © and identified 
the most similar CoV RdRp sequences referenced in GenBank using BLASTN 2.2.29+. An 
alignment was then generated using the 51 nucleotide sequences obtained in this study and 151 
reference CoV sequences representing a large diversity of hosts and geographic origins (Eu- 
rope, Asia, Oceania, America and Africa), with CLC Sequence viewer 8.0 Software (CLC Bio, 
Aarhus, Denmark). A phylogenetic tree was obtained by maximum likelihood using MEGA 
Software v10.0.4 9^, with 1,000 bootstrap iterations, and with the best evolutionary model for 
our dataset as selected by modelgenerator v0.85 ©. 

Host-virus associations were investigated using the phylogeny of Western Indian Ocean 
bats and their associated CoVs. Bat phylogeny was generated from an alignment of 1,030 bp of 
mitochondrial Cytochrome b (Cyt b) gene sequences (Supplementary Table S4), for each CoV 
positive bat species. Finally, bat and pruned CoV phylogenies based on each 393 bp RdRp 
unique sequence fragment were generated by Neighbor-Joining with 1,000 bootstrap iterations, 
using CLC Sequence viewer 8.0 Software (CLC Bio, Aarhus, Denmark)®. Phylogenetic con- 
gruence was tested to assess the significance of the coevolutionary signal between bat host 
species and CoVs sequences, using ParaFit with 999 permutations in the ape package v5.0 in 
R 3.5.1 9765 Tanglegram representations of the co-phylogeny were visualized using the Jane 


software v4.01 99. 
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518 Figure 1. Geographic distribution of the tested samples. N: number of bats sampled for each 


519 location. The open-source GIS software, QGIS v.3.6.1, was used to generate the map. 
520 http://qgis.osgeo.org (2019). 
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Figure 3. Maximum Likelihood (ML) consensus tree derived from 202 coronavirus (CoV) 
RNA-dependent RNA-polymerase partial nucleotide sequences (393 bp). Colored circles at the 
end of branches indicate bat family origin. Sequences in bold refer to bat CoVs detected in this 
study. Bootstrap values >0.7 are indicated on the tree. Scale bar indicates mean number of nu- 
cleotide substitutions per site. The tree was generated with the General Time Reversible evolu- 


tionary model (GTR-T-T, I = 0.18, a = 0.64) and 1,000 bootstrap replicates. 


Alpha CoV 
e Molossidae 


— Bat CoV FMNH 229303 Mozambique Mo.condylurus 2015 MN183182 
—- Bat CoV FMNH 229296 Mozambique Mo.condylurus 2015 MN183179 
——. Bat CoV FMNH 229266 Mozambique Mo.condylurus 2015 MN183180 
—- Bat CoV FMNH 229293 Mozambique Mo.condylurus 2015 MN183181 
— Chaerephon bat coronavirus KY22 Ch.sp Kenya 2006 HQ728486 
BtCoV NCL MCO1 Mo.condylurus South Africa 2012 KF843853 
— PREDICT. GVF CM ECO70102 Mo.condylurus Cameroon 2013 KX284982 
—-. Bat CoV MAY027 Mayotte Ch.pusillus 2014 MN183176 
—1| |. Bat CoV MAY025 Mayotte Ch.pusillus 2014 MN183175 
— Bat CoV MAYO051 Mayotte Ch.sp 2014 MN183174 
——-. Bat CoV MAY015 Mayotte Ch.pusillus 2014 MN183173 
1b Bat CoV. MAY033 Mayotte Ch.pusillus 2014 MN183178 
MEE Bat CoV MAY004 Mayotte Ch.pusillus 2014 MN183177 
------------- Bat CoV UADBA 33728 Madagascar Mo.jugularis 2013 MN183183 
. Bat CoV UADBA 33735 Madagascar Mo.jugularis 2013 MN183186 
Bat CoV FMNH 222698 Madagascar Mo.jugularis 2013 MN183185 
Bat CoV FMNH 222691 Madagascar Mo.jugularis 2013 MN183187 
—— Bat CoV UADBA 33733 Madagascar Mo.jugularis 2013 MN183184 
— Bat CoV RB369 Reunion Mo.francoismoutoui 2013 MN183188 


0.05 


Figure 4. Detail of the a-CoV clade. Molossidae CoVs generated in the study are indicated in 
bold. This sub-tree is a zoom on Molossidae CoV clade from the tree depicted in Figure 3. Boot- 
strap values >0.7 are indicated on the tree. Scale bar indicates mean number of nucleotide sub- 


stitutions per site. 
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Alpha CoV 


e Rhinonycteridae 


—Bat CoV FMNH 228887 Mozambique Tr.afer 2015 MN183162 

—NL63 related bat coronavirus BtKYNL63 15 Tr.afer Kenya 2008 KY073746 
—Bat CoV FMNH 228891 Mozambique Tr.afer 2015 MN183163 

—Bat CoV FMNH 229232 Mozambique Tr.afer 2015 MN183165 

—Bat CoV FMNH 229212 Mozambique Tr.afer 2015 MN183164 

—Bat CoV FMNH 229241 Mozambique Tr.afer 2015 MN183166 

—Bat CoV FMNH 228892 Mozambique Tr.afer 2015 MN183167 

—Bat CoV FMNH 228888 Mozambique Tr.afer 2015 MN183168 

Human coronavirus NL63 isolate HCOV 005 Ho.sapiens Kenya 2009 KP112154 
:Human, Cov NL63 Ho.sapiens Netherlands 2004 NC005831 

-— NL63, related bat coronavirus BtKYNL63 9a Tr.afer Kenya 2010 KYO073744 
-— Bat CoV FMNH 229243 Mozambique Tr.afer 2015 MN183169 


Figure 5. Detail of the a-CoV clade. NL63-like CoVs generated in the study are indicated in 
bold. This sub-tree is a zoom on NL63 CoV clade from the tree depicted in Figure 3. Only 
bootstrap values >0.7 are indicated on the tree. Scale bar indicates mean number of nucleotide 
substitutions per site. 


Alpha CoV 
* Hipposideridae 


—.Bat CoV FMNH 228933 Mozambique Hi.caffer 2015 MN183172 


..Bat CoV. FMNH 228926 Mozambique Hi.caffer 2015 MN183171 


--Bat CoV FMNH 228981 Mozambique Hi.caffer 2015 MN183170 


—Alpaca respiratory coronavirus CAO8 1 Vipacos USA 2008 JQ410000 


—— Camel229E CoV ACO4 Ca.dromedarius SouthKorea 2014 KT253327 


--229E related bat CoV. BtCov. KY229E. 8 Hivittatus Kenya 2010 KY073748 


| — — s. 229E related bat CoV. BtCoV. AT1A F45 Hi.abae Ghana 2010 KT253259 


LL e. Human coronavirus 229E Ho.sapiens NC002645 


L— — — — — —s 229E related bat CoV. BtCoV KW1C, F161 Hi.ruber Ghana 2010 KT253264 
Figure 6. Detail of the a-CoV clade. 229E-like CoVs generated in the study are indicated in 


bold. This sub-tree is a zoom on NL63 CoV clade from the tree depicted in Figure 3. Bootstrap 
values >0.7 are indicated on the tree. Scale bar indicates mean number of nucleotide substitu- 


tions per site. 
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Beta-C CoV 


@Vespertillionidae 
@Nycteridae 


MERS_COV_Korea_Seoul_SNU1_035_Human_Ho.sapiens_South_Korea_2015_KU308549 
MERS COV Hu Riyadh KSA 12160 Human Ho.sapiens South Korea 2016 KX154688 
MERS COV KFU HKU 19Dam Ca.dromedarius Saudi Arabia 2013 KJ650296 

MERS COV 2c EMC Human Ho.sapiens 2012 JX869059 

MERS COV D2731.3 14 Ca.dromedarius United Arab Emirates KT751244 

MERS, COV Ca.dromedarius Egypt 2013 KJ477102 

ERS COV related 5038 Ne.capensis South Africa 2015 MF593268 

tCoV UKR G17 Pi.nathusii Ukraine 2011 KC243392 

tCoV 8 724 Pi.pygmaeus Romania 2009 KC243390 

Bat coronavirus 16BF104 Ep.serotinus South Korea 2016 KY432468 

Bat Coronavirus HKU5 Pi.abramus China 2007 NCO009020 

CoV 2012 174 Ereuropaeus Germany 2012 NC039207 

at Coronavirus HKU4 Ty.pachypus NC009019 

Bat CoV FMNH 228900 Mozambique Ny.thebaica 2015 MN183193 

Bat CoV FMNH 228903 Mozambique Ny.thebaica 2015 MN183195 


at CoV FMNH 228901 Mozambique Ny.thebaica 2015 MN183196 
BtCoV KW2E F82 Ny.ggambiensis Ghana 2011 JX899382 
BtCoV KW2E F93 Ny.gambiensis Ghana 2010 JX899383 
BtCoV KW2E F53 Ny.gambiensis Ghana 2011 JX899384 

— BtCoV KCR230 Pt.parnellii CostaRica 2010 JQ731779 


Figure 7. Detail of the B-C CoV clade. CoVs generated in the study are indicated in bold. This 
sub-tree is a zoom on B-C CoV clade from the tree depicted in Figure 3. Bootstrap values >0.7 


are indicated on the tree. Scale bar indicates mean number of nucleotide substitutions per site. 


Beta-D CoV 
e Pteropodidae 


Bat coronavirus GLCC1 Cy sphinx China 2005 KU182962 
—Bat coronavirus Diliman1525G2 Cy.brachyotis Philippines 2008 AB539082 
—Bat coronavirus 2265 Pt jagori Philippines 2010 AB683971 
—Bat coronavirus KY06 Ro.aegyptiacus Kenya 2006 HQ728484 
Bat coronavirus HKU9 Ro./eschenaultii China NC009022 
Bat coronavirus HKU9 PREDICT LAP11 K0006 £o.spe/laea Laos 2011 KX284911 
Bat coronavirus NWDC188 Me.kusnotoi China 2013 KU182986 
—PREDICT CoV 22 LAP11 D0063 Eo.spelaea Laos 2011 KX284906 
—Bat coronavirus RK074 Eo.spelea China 2010 KX520659 
——— BtKY89 Ei.helvum Kenya 2007 GU065433 
—Bat coronavirus KY24 Ei helvum Kenya 2006 HQ728482 
— Bat CoV UADBA 50636 Madagascar Ro.madagascariensis 2014 MN183192 
—.Bat CoV UADBA 50625 Madagascar Ro.madagascariensis 2014 MN183191 
—— PREDICT CD116088 Ep.franqueto Congo 2014 KX285095 
——PREDICT  CD116004 Mi.pusillus Congo 2014 KX285087 
—— PREDICT GVF CM ECO70527 Ep.gambianus Cameroon 2013 KX285008 
— PREDICT AATHA Ep.sp Tanzania 2013 KX285299 


Figure 8. Detail of the B-D CoV. CoVs generated in the study are indicated in bold. This sub- 
tree is a zoom on B-D CoV clade from the tree depicted in Figure 3. Bootstrap values >0.7 are 


indicated on the tree. Scale bar indicates mean number of nucleotide substitutions per site. 
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Figure 9. Tanglegram representing host-virus co-evolution between bats of the Western Indian 
Ocean and their associated CoVs. Phylogeny of bats (left) was constructed with an alignment 
of 11 Cyt b sequences of 1,030 bp by Neighbor-Joining with 1,000 bootstrap iterations. Pruned 
phylogeny of Western Indian Ocean bats CoVs (right) was constructed with an alignment of 27 
unique sequences of 393 bp from Western Indian Ocean bats CoVs, by Neighbor-Joining with 
1,000 bootstrap iterations. 
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